Estimation of Melody and Bass Lines in Musical Audio Signals
نویسنده
چکیده
あらまし 本論文では,複数の楽器音が混在したモノラルの音楽音響信号に対して,メロディーとベースの音 高(基本周波数)を推定する手法を提案する.従来の音高推定手法や音源分離手法は,たかだか三つの音の混合音 しか扱うことができず,市販の CDによるジャズやポピュラー音楽の音響信号には有効に機能しなかった.本手 法は,混合音下で安定に抽出できない基本周波数成分には依存せず,意図的に制限した周波数帯域(メロディー は中高域,ベースは低域)にある高調波成分が支持する最も優勢な音高を求める.その際,音源数を仮定せずに あらゆる音高の高調波構造が混在しているとみなして混合音をモデル化し,EM(Expectation-Maximization) アルゴリズムにより各高調波構造が相対的にどれくらい優勢かを推定する.更に,マルチエージェントモデルを 導入し,各エージェントが音高の時間的な軌跡を追跡することで,最も優勢で安定な音高の軌跡を得ることがで きる.本手法に基づくシステムを実装して実験した結果,市販の CDからサンプリングした実世界の音響信号に 対し,メロディーとベースの音高をリアルタイムに推定できることを確認した. キーワード 音高推定,ピッチ抽出,音源分離,EMアルゴリズム,音楽理解
منابع مشابه
F0 Estimation of Melody and Bass Lines in Real-world Musical Audio Signals
This paper describes a method for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous methods premised mixtures of a few sounds and had great difficulty dealing with audio signals sampled from compact discs. Our method does not rely on the unreliable F0’s component and obtains the most predominant F...
متن کاملA robust predominant-F0 estimation method for real-time detection of melody and bass lines in CD recordings
This paper describes a robust method for estimating the fundamental frequency (F0) of melody and bass lines in monaural realworld musical audio signals containing sounds of various instruments. Most previous F0-estimation methods had great difficulty dealing with such complex audio signals because they were designed to deal with mixtures of only a few sounds. To make it possible to estimate the...
متن کاملPreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
This paper describes a real-time method, called PreFEst (Predominant-F0 Estimation method), for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Without assuming the number of sound sources, PreFEst can estimate the relative dominance of every possible harmonic structure in the input mixture. It treats the mixture as if it contains all possi...
متن کاملA Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
In this paper I introduce a method, called PreFEst, for estimating the fundamental frequency (F0) of simultaneous sounds in monaural polyphonic audio signals. Most previous F0-estimation methods have had difficulty dealing with such complex audio signals because these methods were designed to deal with mixtures of only a few sounds. Without assuming the number of sound sources, PreFEst can esti...
متن کاملA Predominant-F0 Estimation Method for Real-world Musical Audio Signals
In this paper we describe a robust method, called PreFEst, for estimating the fundamental frequency (F0) of melody and bass lines in monaural audio signals containing sounds of various instruments. Most previous F0-estimation methods have difficulty dealing with such complex audio signals because they are designed for mixtures of only a few sounds. Without assuming the number of sound sources, ...
متن کاملA Real-time Music Scene Description System: Detecting Melody and Bass Lines in Audio Signals
This paper describes a predominant-pitch estimation method that enables us to build a realtime system detecting melody and bass lines as a subsystem of our music scene description system. The purpose of this study is to build such a real-time system that is practical from the engineering viewpoint, that gives suggestions to the modeling of music understanding, and that is useful in various appl...
متن کامل